|
|
Accession Number |
TCMCG075C20040 |
gbkey |
CDS |
Protein Id |
XP_017978666.1 |
Location |
join(22320046..22320106,22320411..22320508,22320613..22320842,22321735..22321902,22323402..22323762,22323836..22323979,22325140..22325292,22325395..22325511,22325927..22326007,22326264..22326395,22327006..22328766,22329098..22329136,22329316..22329408) |
Gene |
LOC18596566 |
GeneID |
18596566 |
Organism |
Theobroma cacao |
|
|
Length |
1145aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018123177.1
|
Definition |
PREDICTED: uncharacterized protein LOC18596566 isoform X2 [Theobroma cacao] |
CDS: ATGGAGGTACAAATTTCAACAACTGAAACAGAGTCGCATCCTGAAACGGGAGATTCTTTAGGTTTTGATGAGGTGAAACTTAAGGAATTTATTGCTCAAGGAAAATTAGACTTTGATGGATGGACTTCCCTTATTTTAGACGTTGAGAATTCATTTCATGATAAAATAGAGAAAATATGTTTAGTTTATGATTCCTTCTTGTTTGAGTTTCCTTTGTGCTATGGTTACTGGAGGAAGTATGCTGATCACGTGATACGTTTATGCACCATTGACAAAGCTGTTGAAGTCTTTGAACGAGCTGTGCAATCAGCAACATACTCTGTTGGTGTTTGGGTTGACTATTGTGGCTTTGCTATTTCAGTTTTTGAAGATGCCAATGATATTCGTAGACTATTTAGAAGAGCCATGTCCTTCATTGGAAAGGATTACTTATGTCATACCTTGTGGGATAAGTATCTTGAATTTGAGTTTTCTCAGCAGCAGTGGAGCTCCTTAGCTAATGTTTACATCCGAACTTTGAGATTTCCTTCGAAAAAGTTACATCGTTATTATGAGGGTTTCCAGAAGCTGGCTGCCACTTGGAAAGAAGAGATGCAATGTCTGAATGATTTGGATTTAGAATCAGATCCCAAAGTAGAAAATGAAGTCTCTACATGTTATACTGATGATGAAATTTCTTGTGTCATCAAAGACTTGCTGGATCCTTCCACTGGTGTGGATGGTACTAAAGCACTGGAAAAGTATCTGTCAATTGGGAAGCAATTCTACCAGGAAGCGTCTCAGTTGGGTGAAAAAATACATCGCTTTGAGACTAGCATTCGAAGGCCTTATTTTCATGTAAATCCACTTGACATCACTCAATTGGAGAACTGGCAAGAGTATTTAAACTTTGTAGAGATGCATGGCGATTTTGATTGGGCTGTTAAACTTTATGAGAGATGCCTAATTCCATGTGCTAATTATCCTGAGTTCTGGATGCGTTATGTGGATTACGTGGAAAGTAAGGGTGGAAGAGAGATAGCAAACTTTGCATTGGCCCGAGCAACACAAATTTTTCTGAAGAGAATGCCGGTGATCCACCTTTTCAGTGCTAGGTTTAAGGAGAAAATAAGGGATGTTTCAGGTGCACATGTAGCACTTGCTGAATATGAGACAGAATCAGATCTCAGCTTTGTTGAAACTGTATCAATAAAGGCTAACATGGAAAAACGTTTGGGCAATTTTGTAGCAGCTTCTAATATATACAAAGAGGCAGTGGAAATTGCTGCTGCAAAGGAAAAGTTTGACATTCTTCCCATATTGTATATTCATTTTTCTCGACTTCAATACATGATTACCAGTAAGAGTGATGCTGCCAGAGACATCTTAATAGATGGTATCAAACATGTGCCTCATTCCAAATTGCTTTTAGAGTTTGGAATGATGCATGGAGGGCATACGCACATACATGTGTTAGATGCTATAATAGATAATGCAATATCTCCAGGGCTCTCTCAAGGTATGAATGCAGAAGAGGCGGAGGATGTATCAAGCTTATATTTACAGTTTGTTGATCTTTGTGGGACCATAGATGATATAAGAAGAGCATTAAACCGGCACATAAAATGTTTTCCTGGCTCCACAAGGATGAGCACGTATATGTTCTCAGTTAATGGTATAAAACCTATACCTTTGAAGATGACATCTGGCCGAAGACAAGAAAGCCTTGGTGCTTTGCCTTCCCATCCATCTGGAGGTGGAAGTTTGGACGTCCCAACTCAGTCACTATCTCTAGACAAGATAATGAAGTCTCCAGAAAATGATGATACCCAGCGTAATCATGCTGCCTTGGACTGGGTTTTGGACAAGAAATCACCAAGGCAGGAAAATCATGAAATTCCTTCTGACCAGGCTACCGTCAACAGGCTTCAATCAGAGGTTGATGAAAGTTTGCAGGAGGGAATGCAGCAAGGTTCTGAAGATGTTTCAAAGCAGCTAAGAGAAGATATAAAAGCTAATACAAATTTGTCATCTCCTGATTTAATACATGAAGTGACAAATGAAGTTGAAGCACTACAAACTTCAGAAGAAAATTCTAAAGAAAATGATATCAAGCAAGAGCATGATCATAAGTCGGAACAGGATTTAAACCAACTCTCACTGGAGAGACTTTCATTAGATCATCTAGATCATAAGTGTTCAGATTCAATCAGAGTCGCAAATCAGGAAGGTGAAACTTTTGTAGAAACCAGGTTATCAAATGGAAGCATGGTGAAAAAAGAACCTCCTCAAGAAACCAGCATGTGCTATGGAAGCGTGCCAGAGGGTGGTCAAAGTAATGATGGACATCACCTGGTATCCAGCCCAAGGAGTGCTCAAGCATCTGATTCTGCTGGAATTCAAACTGAAATGGCCAGTCCTTCATCTTCAGCAAGTCAGCAGAACATCAAGAAAACAGAGCCGCCTTTACGGAGGACACCTCCTTATGGTGGTGGAAGTTGGCATCAAAGGAGTAAAGCTGACAGAGTTCATAGAGAAAACAAACTTGGATTCCGAAGGCATTCTCATAAAAGGCTGCAGCAAAGGCAGCAGGTGTCCCCACAAAGGCTGTGTCCACGAAGTGACACTGGCACACAAGTGCCCATGAGTCAAGGTTACCCCAGTCAACCAATGTCTTGGCAGAGTCCACAAGTTCAACAGGGTGGCCAAACACAGAGTCAGTACTCCACATCTGCTGCTCATCCTAATCTAATAACAGCCCATGGCTGGTCTATGCACAACATGCAACTACAGAATTTTGTCCCTAGTCAGTCTCAAGTACTTCCTCAACCTGCTCATCCTCCACCACAGATCTCTCAACATCCCATGCAAAGTAATGAGCAGCTTGGGCAAATGCAGAATAACCAAGCATATAATCAAATGTGGCAATATTATTTCTACCAACAGCAACAGCAGCATCCATTTCTTTTGCAGCAACCCCATAATCAACAGCCTCAGCCCCAGCAGCAGCTTTTACAGCAACAGTATCAGCAACATCAGCAGATGCTACAAGTGCAACAGCAACAATTACTCTATCAGCACCCACAGCTACTGCAGCTTGAGCAGCAACACCAATTTGTTCAACATCAGCAGCAACAGTATCTGCAACAGCAGCAGCAGCTGATGCAAGAGCAACAACTACAACAGCAAGGCTCTTATCTGCAACAACTTCCACCACAAAATCATCATCTTTTCTTGCAGCAGCAGCAGCAAGAACAAGAGCAAAGACAACAAGAAGAGCAGATTGCAACATCACAGGTTCAGACATTGAATGACTCAAGCAAAGAGGAATCCATGATGGAAACAAGGGTACAGACAAGATTGCAGGGCCAAGGTACATTGTCTCATGGGACCGATGCGTCTAAAACTGTATCATCTGCTGCATCCCCAAATTCCAAGCAAAGATCTTATTCAAGCTAA |
Protein: MEVQISTTETESHPETGDSLGFDEVKLKEFIAQGKLDFDGWTSLILDVENSFHDKIEKICLVYDSFLFEFPLCYGYWRKYADHVIRLCTIDKAVEVFERAVQSATYSVGVWVDYCGFAISVFEDANDIRRLFRRAMSFIGKDYLCHTLWDKYLEFEFSQQQWSSLANVYIRTLRFPSKKLHRYYEGFQKLAATWKEEMQCLNDLDLESDPKVENEVSTCYTDDEISCVIKDLLDPSTGVDGTKALEKYLSIGKQFYQEASQLGEKIHRFETSIRRPYFHVNPLDITQLENWQEYLNFVEMHGDFDWAVKLYERCLIPCANYPEFWMRYVDYVESKGGREIANFALARATQIFLKRMPVIHLFSARFKEKIRDVSGAHVALAEYETESDLSFVETVSIKANMEKRLGNFVAASNIYKEAVEIAAAKEKFDILPILYIHFSRLQYMITSKSDAARDILIDGIKHVPHSKLLLEFGMMHGGHTHIHVLDAIIDNAISPGLSQGMNAEEAEDVSSLYLQFVDLCGTIDDIRRALNRHIKCFPGSTRMSTYMFSVNGIKPIPLKMTSGRRQESLGALPSHPSGGGSLDVPTQSLSLDKIMKSPENDDTQRNHAALDWVLDKKSPRQENHEIPSDQATVNRLQSEVDESLQEGMQQGSEDVSKQLREDIKANTNLSSPDLIHEVTNEVEALQTSEENSKENDIKQEHDHKSEQDLNQLSLERLSLDHLDHKCSDSIRVANQEGETFVETRLSNGSMVKKEPPQETSMCYGSVPEGGQSNDGHHLVSSPRSAQASDSAGIQTEMASPSSSASQQNIKKTEPPLRRTPPYGGGSWHQRSKADRVHRENKLGFRRHSHKRLQQRQQVSPQRLCPRSDTGTQVPMSQGYPSQPMSWQSPQVQQGGQTQSQYSTSAAHPNLITAHGWSMHNMQLQNFVPSQSQVLPQPAHPPPQISQHPMQSNEQLGQMQNNQAYNQMWQYYFYQQQQQHPFLLQQPHNQQPQPQQQLLQQQYQQHQQMLQVQQQQLLYQHPQLLQLEQQHQFVQHQQQQYLQQQQQLMQEQQLQQQGSYLQQLPPQNHHLFLQQQQQEQEQRQQEEQIATSQVQTLNDSSKEESMMETRVQTRLQGQGTLSHGTDASKTVSSAASPNSKQRSYSS |